A Romanian Corpus for Speech Perception and Automatic Speech Recognition

نویسندگان

  • AHSANUL KABIR
  • MIRCEA GIURGIU
چکیده

A speech corpus is available in Romanian to use as the common material in speech perception and automatic speech recognition. It consists of high-quality audio of 400 sentences spoken by each of 12 speakers. Utterances are simple, syntactically identical phrases such as “muta bronz cu p 2 agale.” Preliminary intelligibility tests using the audio signals suggest that the collected speech is easily identifiable in quiet and low levels of noise. The corpus is annotated at the phoneme, syllable and word level and is available on the website for research use. Key-Words: Romanian Speech Corpus, Speech Intelligibility, Speaker Intelligibility

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate

This paper first introduces a newly-recorded high quality Romanian speech corpus designed for speech synthesis, called “RSS”, along with Romanian front-end text processing modules and HMM-based synthetic voices built from the corpus. All of these are now freely available for academic use in order to promote Romanian speech technology research. The RSS corpus comprises 3500 training sentences an...

متن کامل

A Historically Perspective of Speaker-independent Speech Recognition in Romanian Language

In this paper we present our teamwork main results in the area of the automatic speech recognition and understanding in Romanian language using hidden Markov models (HMM), artificial neural networks (multilayer perceptron, support vector machines, Kohonen networks) and hybrid models (fuzzy HMM, fuzzy multilayer perceptron, HMM/multilayer perceptron) for different small Romanian language corpora...

متن کامل

Correlation between Auditory Spectral Resolution and Speech Perception in Children with Cochlear Implants

Background: Variability in speech performance is a major concern for children with cochlear implants (CIs). Spectral resolution is an important acoustic component in speech perception. Considerable variability and limitations of spectral resolution in children with CIs may lead to individual differences in speech performance. The aim of this study was to assess the correlation between auditory ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011